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Abstract 

Stackelberg security game models and associated computational tools have seen de¬ 
ployment in a number of high-consequence security settings, such as LAX canine patrols 
and Federal Air Marshal Service. These models primarily focus on isolated systems where 
only one defender is present, despite being part of a more complex system with multi¬ 
ple players. Furthermore, many real systems such as transportation networks and the 
power grid exhibit interdependencies between targets and, consequently, between decision 
makers jointly charged with protecting them. In order to understand such multi-defender 
strategic interactions present in security, we investigate game theoretic models of secu¬ 
rity games with multiple defenders. Unlike most prior analysis, we specifically focus on 
the situations in which each defender must protect multiple targets, so that even a single 
defender’s best response decision is, in general, highly non-trivial. We start with an an¬ 
alytical investigation of multi-defender security games with independent targets, offering 
an equilibrium and price-of-anarchy analysis of three models with increasing generality. In 
all models, we find that the defenders have the incentive to over-protect the targets, at 
times significantly. Additionally, in the simpler models, we find that the price of anarchy is 
unbounded, linearly increasing both in the number of defenders and the number of targets 
per defender. Considering interdependencies among targets, we develop a novel mixed- 
integer linear programming formulation to compute a defender’s best response, and make 
use of this formulation in approximating Nash equilibria of the game. We apply this ap¬ 
proach towards computational strategic analysis of several models of networks representing 
interdependencies, including real-world power networks. Our analysis shows how network 
structure and the probability of failure spread determine the propensity of defenders to 
over- or under-invest in security. 

1. Introduction 

Security, physical and cyber, has come to the forefront of national attention, particularly 
after 9/11. Among the variety of approaches that are used to tackle security problems, 
from risk analysis to red teaming, game theory has had a significant impact, with tools 
based on game theoretic analysis having been deployed in LAX airport to schedule canine 
patrols (Paruchuri et ah, 2008; Jain et ah, 2008; Pita et ah, 2009), by Federal Air Marshall 
Service (FAMS) to schedule the air marshals (Kiekintveld et ah, 2009; Jain et ah, 2010, 
2010), and by the US Coast Guard to schedule boat patrols (Shieh et ah, 2012). All of these 
deployments, and numerous other related efforts, have cast security as a Stackelberg game 
between a single defender and an attacker, in which the defender leads (i.e., acts first), 
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choosing a probability distribution over defense actions, and the attacker, upon learning 
this probability distribution, chooses a response (Conitzer & Sandholm, 2006). In many 
cases, the attacker is modeled as a rational agent who selects an optimal response and, in 
the many applications that compute a Strong Stackelberg equilibrium, an attacker is often 
assumed to break ties in the defender’s favor (Paruchuri et ah, 2008; Korzhyk et ah, 2011). 

A crucial assumption that all these efforts have in common is that a single defender is 
responsible for all the targets that need protection, and that she has control over all of the 
security resources. However, there are many domains in which there are multiple defender 
agencies who are in charge of different subsets of all targets. In practice, numerous parties 
are responsible for security; indeed, the fact that the basic framework has been deployed 
by different entities and agencies makes this manifest already. If security decisions made 
by different parties were entirely independent, both from the defender’s and the attacker’s 
perspective, a single-defender model would be entirely satisfactory. However, the assets 
protected by different entities are typically interdependent, or, more generally, have value 
to others who are not involved in security decisions. Additionally, attackers, insofar as they 
may target different sectors under the charge of different defenders, are resource constrained, 
implicitly coupling otherwise independent targets. 

An important motivating application for our multidefender security game is security and 
reliability in the power grid. Independent System Operators (ISOs) and profit-driven inde¬ 
pendent utility operators are largely responsible for operating and controlling subsystems 
of the entire grid (Lazar, 2011). These operators are held responsible for the reliabil¬ 
ity of their system, and thus have independent, and possibly even competing, goals with 
neighboring ISOs. As such, their security decisions are made independently, despite the 
interdependencies present between subsystems. As a result of this organization, cascading 
failures in the power grid can present a great threat to the entire system, even when an 
explicit attacker is not present. This problem is exacerbated by the fact that components in 
the grid are controlled by multiple entities and are also dependent on other independently 
operated utility networks (water, communications, natural gas, etc.). 

We extend the previous Stackelberg game models in two ways: 

1. an analytic equilibrium and price of anarchy (PoA) characterization of 3 multidefender 
security scenarios, in which we assume homogeneous and independent valuations of 
the targets for each defender; and 

2. a computational analysis leveraging a novel mixed-integer linear program (MIP) ap¬ 
proach for computing a defender’s best response, combined with a novel heuristic 
method for approximating equilibria in interdependent multi-defender security games 
with heterogeneous targets. 

In case 1 where there are multiple defenders, and the values of the targets are independent 
and homogeneous among the defenders, our analysis is focused on three models of such 
multi-defender games (each varying in their level of generality). We show that a Nash 
equilibrium among defenders in this two-stage game model need not always exist, even 
when the defenders utilize randomized strategies (i.e., probability distributions over target 
protection levels); this is distinct from a model in which the attacker moves simultaneously 
with the defenders, where a mixed strategy equilibrium is guaranteed to exist. When 
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an equilibrium does exist, we show that the defenders protect all of their targets with 
probability 1 in all three models, whereas the socially optimal protection levels are generally 
significantly lower. When no equilibrium exists, we characterize the best approximate Nash 
equilibrium (that is, one in which defenders have the least gain from deviation), showing that 
over-investment is substantial in this case as well. Our price of anarchy (PoA) analysis, 
which relies on the unique equilibrium when it exists, and the approximate equilibrium 
otherwise, demonstrates a surprising finding: whereas PoA is unbounded in the simpler 
models, increasing linearly with the number of defenders, the more general model shows 
this to be an atypical special case achieved when several parameters are exactly zero. More 
generally, PoA is bounded by a constant. 

For case 2, we introduce interdependencies between targets. Because closed-form anal¬ 
ysis in this setting is intractable, we propose a novel mixed-integer linear programming 
approach combined with a novel heuristic method to approximate equilibrium behavior. 
Unlike other multi-defender models (e.g., (Kunreuther & Heal, 2003; Chan et ah, 2012; 
Bachrach et ah, 2013; Acemoglu et ah, 2013)), our approach maintains the typical com¬ 
plexity of individual defender decision process in the multi-defender framework, with each 
defender responsible for securing many^ possibly interdependent, targets. Our setup gives 
rise to two competing externalities of security decisions: a positive externality, where greater 
security implies reduced contagion risk to other defenders, and a negative externality, which 
arises because high security by one player pushes the attacker to attack someone else’s as¬ 
sets. We study the impact of competing externality effects of defense on the resulting 
Nash equilibrium outcomes as a function of network topology (using both synthetic and 
real networks), interdependent risk, and the level of system decentralization. One of our 
key findings is that the impact of system decentralization on security and welfare can be 
non-monotonic when systems are highly interdependent: high levels of decentralization can 
yield near-optimal outcomes, even as moderate decentralization results in significant under¬ 
investment. With weak interdependencies, on the other hand, an increasingly decentralized 
system tends more strongly to over-invest in security. 

The remainder of our paper is outlined as follows. In Section 2, we give an overview 
of related work. In Section 3, we briefly outline the definitions and solution concepts of 
our independent and interdependent multi-defender security games, respectively. Section 4 
provides an equilibrium and PoA analysis of the homogeneous, independent security game 
models. Section 5 further explains the interdependent multi-defender model, and presents 
results on well-studied synthetic networks, as well as on real-world power grid networks. 

2. Related Work 

Our work, like much work in the recent security game literature, builds on the notion 
of Stackelberg games (Osborne & Rubenstein, 1994), which model commitment in strate¬ 
gic settings. The first thorough computational treatment of randomized (mixed strategy) 
commitment was due to Conitzer and Sandholm (2006). In this line of work, of greatest rel¬ 
evance to our effort are multiple-leader Stackelberg games (Sherali, 1984; DeMiguel & Xu, 
2009; Leyffer & Munson, 2007; Rodoplu & Raj, 2010; Kulkarni & Shanbhag, 2014; Sinha 
et ah, 2014). In many cases, these approaches leverage specialized problem structure, and 
are not immediately applicable to our setting. In particular, Sherali (1984) and DeMiguel 
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and Xu (2009) focus on relatively simple models with firms setting production quantity (a 
single variable), aiming to maximize profit. Both show existence and uniqueness of equi¬ 
libria in their setting, and leverage these characterization results to obtain solutions to the 
games. Similarly, Rodoplu and Raj (2010) consider a relatively simple model of network 
competition in which leaders are nodes setting prices for packets transmitted through them; 
again, each leader only sets a single variable, the utility functions are problem-specific, and 
algorithms are specialized to the particular problem structure (and are inapplicable to our 
setting). 

Sinha et al. (2014) propose an evolutionary algorithm for solving bi-level Stackelberg 
problems, but their problem structure is also highly specific to the domain of interest (firms 
choosing production, investment, and marketing, and maximizing profit), and the evo¬ 
lutionary algorithm leverages significant simplifications, such as the assumption that the 
market eventually clears. Leyffer and Munson (2007) present a very generic multi-leader 
multi-follower setting and solution framework in the context of shared complementarity 
constraints (which is the case for our problem, where a single follower attacks a single tar¬ 
get), but rely on separability of objective functions in leader and follower variables, the 
assumption that does not obtain in our setting (in addition, their approach only scales to 
2-4 leaders, whereas we are able to approximately solve games with 64 leaders). Kulkarni 
and Shanbhag (2014) offer a deep theoretical treatment of a relatively broad class of multi¬ 
leader multi-follower games, but much of their analysis and positive results are restricted 
to potential games, and they do not offer specific algorithmic suggestions. Like us, they 
leverage shared constraints to resolve the issue of incompatible leader assumptions about 
the follower tie-breaking behavior. 

Our point of departure is a class of Stackelberg games specifically pertinent to security: 
commonly, these are simply known as security games (Korzhyk et ah, 2011; Paruchuri et ah, 
2008; Jain et ah, 2010; Vorobeychik et ah, 2011). In these games, a single defender allocates 
a set of resources among potential targets of attack in a randomized fashion (that is, the 
defender commits to a probability distribution over resource-to-target mappings), with an 
attacker choosing a single target to attack after observing the defender’s strategy. Almost 
universally in this domain, a Strong Stackelberg equilibrium (SSE) is a solution concept of 
choice. In SSE, the follower (attacker) is assumed to break ties in the defender’s favor. As 
we will see below, this solution concept presents conceptual and technical problems in a 
multi-defender setting. 

A similar, though mostly orthogonal, line of work are network interdiction problems (Cormi- 
can et ah, 1998; Woodruff, 2003), in which a leader attempts to interdict a network over 
which the follower subsequently solves a variation of a network flow problem. Unlike the 
literature on security games, as well as our setting, network interdiction problems are almost 
universally zero-sum (minimax). 

Another somewhat related line of work considers the problem of coordination and team¬ 
work among multiple defenders in a purely cooperative setting (Jiang, Procaccia, Qian, 
Shah, & Tambe, 2013; Shieh, Jiang, Yadav, Varakantham, & Tambe, 2014a). This work, 
however, is entirely unlike ours: in particular, our primary focus is on the impact of in¬ 
centive misalignment among the defenders with different (though certainly related) moti¬ 
vations, rather than coordination issues and teamwork. While often effective coordination 
among multiple defenders can be achieved, just as often (if not predominantly) decentral- 
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ization of decision making processes and resources inherently give rise to distinct, and often 
conflicting, incentives among defenders. 

Among the earliest multi-defender models is the literature on interdependent seeurity 
games (Kunreuther & Heal, 2003), in which interactions among multiple defenders are mod¬ 
eled as an n-player, 2-action game, where a player decides whether to invest in security; 
however, no attacker is considered. More recently, time-dependent scenarios where coor¬ 
dination of defender resources amongst multiple defenders is assumed have been studied 
using Markov decision processes (Shieh et ah, 2014b). Since total cooperation is assumed, 
this model effectively reduces to a single defender game in which the defender controls all 
resources. A recent extension, interdependent defense games (Chan et ah, 2012), does con¬ 
sider an attacker who acts simultaneously with the defenders, rather than after observing 
the joint defense configuration, as in our model. Interdependent defense games have also 
been studied in the context of traffic infrastructure defense (Alderson et ah, 2011). Two re¬ 
cent efforts studying multi-defender games explicitly model interdependence among targets 
through a probabilistic contagion process (Bachrach et ah, 2013; Acemoglu et ah, 2013). 
Like our paper, they consider attackers who observe the joint defense prior to making a 
decision, but each defender is restricted to secure a single node, and strategy space is as¬ 
sumed to be continuous. Vorobeychik et al. (2011) is, to our knowledge, the only other 
attempt to study strategic settings related to security in which each player’s decision space 
is combinatorial. However, this work does not consider a strategic attacker. 

3. Multidefender Models 

Our modeling effort proceeds in four steps, each generalizing the previous. As we see be¬ 
low, each generalization step reveals new and surprising insights about the multi-defender 
security setting, allowing us to appreciate the fundamental incentive forces. The first three 
models deal with homogeneous, independent targets and will be analyzed exactly using 
Nash equilibrium and price of anarchy (PoA) analysis. Our final model introduces interde¬ 
pendencies, and will be analyzed using computational methods. 

3.1 The Baseline Model 

We start with a model which most reflects the related literature: in particular, this model 
involves n defenders and a single attacker, with each defender engaged in protecting a 
single target. Each target has the same value to the defender u > 0. We suppose that 
the defender has two discrete choices: to protect the target, or not. In addition, the 
defender can randomly choose among these; our focus is on these eoverage probabilities (i.e., 
the probability of protecting, or covering, the target), which we denote by qi for a given 
defender i. The attacker is strategic, can observe the defenders’ coverage probabilities, 
and chooses a target that maximizes the damage. We assume that attacker is indifferent 
among the targets, and attacks the target with the lowest coverage probability, breaking 
ties uniformly at random. In a given scenario, for all defenders, the attacker’s strategy is 
a vector of probabilities P —< Pi->P2-> >•> where pi is the probability of attacking the 

target protected by defender i, with ~ 1* 

We assume that if the attacker chooses to attack a target corresponding to defender 
i and defender i chooses to protect the target, then the utility of both is 0, and if the 
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attacker attacks the target but it is not protected, then the utility of the defender is —v 
while the attacker’s utility is v. If a defender chooses to cover a target, it will incur a cost 
c > 0. Additionally, we assume that targets are independent, i.e., if defender i is successfully 
attacked, all other defenders j ^ i receive 0 utility. We can thus define the expected utility 
of a defender i as 

Ui ^PiUi + (1 -Pi)u^, 

where is the utility of i if it is attacked, and is the utility of i if it is not attacked. By 
the assumptions above, 

< = -(1 - qi)v - qiC = -v + qi{v - c) 
uf = -qiC. 


3.2 The Multi-Target Model 

Our key conceptual departure from related work is in allowing each defender to protect 
multiple targets, aligning it better with practical security domains. Specifically, suppose 
that there are n defenders, each protecting A; > 1 targets. Then the strategy of defender i 
will be a vector < qn^ qi 2 ^ ...qik >• The strategy profile of the attacker can then be described 
as a matrix of probabilities, 


(pll 

Pl2 . 

• • pik\ 

P21 

P22 • 

■ ■ P2k 

\Pnl 

Pn2 . 

Pnk) 


in which Yl^j=iPi 3 — 1 Pij ^ 0 each i and j. The expected utility of a defender 
i in this model is 

k 

Ui = '^PijKj + {1-Pij)u^p 

where is the utility of target j to defender i if it is attacked, and is the utility of 
target j to i if it is not attacked. Using the notation introduced earlier, we have 

Uij = -(1 - qij)v - qijC = -v + qij{v - c) 

'^ij = -QijC- 


3.3 The General Model 

Generalizing further, we assume that if the attacker chooses to attack a target protected 
by defender i and the defender chooses to protect the target, then the utility of the target 
to defender i is and if the attacker attacks the target but it is not protected, then the 
utility of the target to the defender is U'^. It is reasonable to assume that > U^. If 
the target of defender i is not attacked, then we assume that the utility of the target for 
defender i is Q > U^. Other assumptions are the same as those in the multi-target model. 
In the general model, therefore, 

= qijU^ + (1 - - qijc = {u^ -u^- c)qij + 

~ ^ Qij^- 
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3.4 The Interdependent Model 

The previous three models featured three important restrictions: first, that target values 
are homogeneous, second, that targets are independent, and third, that defenders protect 
the same number of targets. We now relax these restrictions. Suppose that a defender i can 
choose from a finite set O of security configurations for each target j ^ Ti^ where Ti is the 
set of targets under i’s direct influence. Let T be the set of all targets, that is, T = U^T^, 
with |T| = m. A configuration o G O for target j G Ti incurs a cost to the defender 
i. If the attacker attacks a target j ^ T while configuration o is in place, the expected 
value to a defender i is denoted by while the attacker’s value is We assume in 
this model that each player’s utility depends only on the target attacked and its security 
configuration (Kiekintveld et ah, 2009; Letchford & Vorobeychik, 2012). We denote by 
the probability that the defender i chooses o at target j ^ Ti. 

While the problem we study assumes that that the utility of any player for a given 
target depends only on its security configuration o, there is a rather natural way to model 
interdependencies while retaining this structure, proposed by Letchford and Vorobeychik 
(2012). Specifically, suppose that dependencies between targets are represented by a graph 
(T,L^), with T the set of targets (nodes) as above, and E the set of edges (j,/), where 
an edge from j to / means that a successful attack on j may have impact on j'. Each 
target j has associated with it a value, for the defender i, which is the loss to i if j is 
affected (e.g., compromised, broken). The security configuration determines the probability 
Zj{j) that target j is affected if the attacker attacks it directly and the defense configu¬ 
ration is o. We model the interdependencies between the nodes as independent cascade 
contagion (Kempe et ah, 2003; Letchford & Vorobeychik, 2012). The contagion proceeds 
starting at an attacked node j, affecting its network neighbors / each with probability r^j/, 
the contagion then spreads from the newly affected nodes f to their neighbors, and so on. 
The contagion can only occur one time along any network edge, and once a node is affected 
it stays affected through the diffusion process. Each player’s valuation for each target is 
then updated based on the probability of a failure cascading to one of the player’s owned 
targets. 

3.5 The Weakness of Strong Stackelberg Equilibrium 

By far the most important solution concept in Stackelberg security games is a Strong Stack¬ 
elberg equilibrium (SSE). A SSE is characterized by an assumption that the attacker breaks 
ties in defender’s favor. When there is a single defender, this is well defined, and quite 
reasonable when the defender can commit to a mixed strategy: a slight adjustment in the 
defense policy will force the attacker to strictly prefer the desired option, with little loss to 
the defender. As we now illustrate, however, SSE is fundamentally problematic in a multi¬ 
defender context, because the notion of “breaking ties in defender’s favor” is no longer well 
defined in general, as we must specify which defender will receive the favor. 

To see concretely what goes wrong, consider the example in Figure 1. In this example 
there are two defenders, one who defends the target on the left, while the other defends 
the target on the right. Both defenders value their respective targets at 1, and have no 
value for the counterpart’s target. The cost of defending each target is 0 < 5 <C 1. Now, 
consider a strategy profile in which qj — 0 for both targets t, and let us focus on the best 
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cost = £ 


q=0 q=0 


Figure 1: Example of a problem with a SSE in a multi-defender setting. 

response of the first (left) defender. If this defender attempts to compute an SSE by fixing 
the strategy of the second player, he perceives his utility under the current strategy profile 
to be 1, since he would assume that the attacker breaks ties in his favor and, thus, attacks 
the defender on the right. By the same logic, the defender on the right will assume that 
the attacker will attack his counterpart, and perceive qj = 0 to be the best response. Since 
the attacker actually attacks one of them, the best response of the defender being attacked 
is to defend with a small probability, pushing the attacker towards the other target. What 
goes wrong here is that both players assume that the attacker attacks the other (breaks ties 
in their favor), which is inconsistent with the assumption that the attacker will certainly 
attack some target.^ 

3.6 Solution Concepts 

Since the classic (two-player) SSE solution concept used in Stackelberg security games does 
not conceptually extend to be an individual defender best-response problem in the multi¬ 
defender setting, we need to consider an alternative. One option is to compute an arbitrary 
subgame perfect equilibrium. However, we wish to impose a natural constraint on the 
solution concept that the attacker’s best response be computed consistently for any joint 
defense policy, just as it is in a SSE (in other words, we wish to fix a tie-breaking rule). One 
natural tie-breaking rule is that the attacker chooses a target uniformly at random from the 
set of all best responses. We call the corresponding solution concept (which is a refinement 
of the subgame perfect equilibrium of our game) the Average-case Stackelberg Equilibrium 
(ASE). The crucial property of this solution concept that we desire is that the attacker’s 
behavior presumed by a defender’s best response problem is independent of that defender’s 
identity, a property that SSE violates. As we demonstrate below, ASE is not guaranteed 
to exist, in which case we focus on e-equilibria, in which no defender gains more than e by 
deviating; in particular, we will consider e-equilibria with the smallest attainable e. 

To measure how the efficiency of the game degrades due to selfish behavior of the 
defenders, we consider Utilitarian Social Welfare and (e)-Price of Anarchy in our paper. 
Utilitarian Social Welfare is the sum of all defenders’ payoffs. For the smallest attainable 


1. The problem we observe is similar to the issue of inconsistent conjectures the leaders in a multi-leader 
Stackelberg game could have about follower behavior noted by Kulkarni and Shanbhag (Kulkarni & 
Shanbhag, 2014). The solution we propose below—ASE—has the effect of imposing a shared constraint, 
the idea introduced by Kulkarni and Shanbhag generically. We note, however, that our concern here 
is not merely the fact that inconsistent conjectures lead to disequilibrium; rather, they could lead to 
nonsensical equilibria! 
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6, we define e-Price of Anarchy (e-PoA) as follows: 


e-PoA = 


SWo 

SWe 


where SWq is the optimal (utilitarian) social welfare that can be obtained (i.e., if there was 
a single defender), and SWe is the worst-case (utilitarian) social welfare in e-equilibrium. 
An underlying assumption of this definition is that the value of SWq and SWe are both 
positive. If they are both negative, then e-PoA will be the reciprocal of above equation. 
Note that the ordinary Price of Anarchy is a special case of e-Price of Anarchy with e = 0. 


4. Equilibrium Analysis of Independent Multi-Defender Security Games 

In this section, we consider scenarios in which the values of the targets are independent 
and homogeneous among the defenders. Our equilibrium and Price of Anarchy analysis will 
show that a Nash equilibrium among defenders in the two-stage game model (equivalently, 
ASE) need not always exist, even when the defenders utilize randomized strategies (i.e., 
probability distributions over target protection levels). For cases when there is no Nash 
equilibrium, we make use of approximate Nash (ASE) equilibrium and the associated (e)- 
Price of Anarchy. 

4.1 The Baseline Model 

Our first result presents necessary and sufficient conditions for the existence of a Nash 
equilibrium among defenders in the baseline model, and characterizes it when it does exist. 

Theorem 1. In the Baseline model, Nash equilibrium exists if and only if u > c. In this 
equilibrium all targets are protected with probability 1. 

Proof. Firstly, we claim that Nash equilibrium among defenders can appear only if all tar¬ 
gets have the same coverage probability q. Otherwise, some defender j who has probability 
0 of being attacked has the incentive to decrease her qj. To find the Nash equilibria, we 
need only consider strategy profiles in which all targets have the same coverage probability. 

When all defenders use the same coverage probability g, each defender’s expected utility 
is 

(v — cn)q — V 

u — -. 

n 

If g < 1, some defender i could increase q to q + 6^ where 5 is a small positive real number, 
to avoid being attacked, and receive utility u' — —{q + 5)c, so that 

/ v(l — q) — nc6 

u — u — -. 

n 

As S can be arbitrarily small, N — u > 0 when q < 1. Consequently, when g < 1, a defender 
always has an incentive to deviate, which implies that the only possible Nash equilibrium 
can be for all players to play q = 1. 

When all defenders use coverage probability q = 1, each earns an expected utility of 

u = —c. 
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If a defender i decreases her coverage probability to g' < 1, then she will be attacked with 
probability 1, and receive expected utility v! — —v + q\v — c)^ so that 

u — u = {v — c){q — 1). 


If u > c, then u' — u < and q — 1 is indeed a Nash equilibrium. If u < c, however, 
u' — u > 0^ which implies that a Nash equilibrium does not exist. □ 


Thus, if a Nash equilibrium does exist, it is unique, with all defenders always protecting 
their targets. But what if the equilibrium does not exist? Next, we characterize the (unique) 
e-equilibrium with the minimal e that arises in such a case. We will use this approximate 
equilibrium strategy profile as a prediction of the defenders’ strategies. 


Theorem 2. In the Baseline model, if v < c, the optimal e-equilihrium is for all defenders 
to cover their target with probability The corresponding e is . 

Proof. We firstly consider strategy profiles in which all targets have the same probability q 
of being protected. Then each defender’s expected utility is 

iv — cn)q — V 

u — -. 

n 

Suppose 0 < q < 1. If a defender i slightly increases q to q + 6i^ she could receive a 
utility u' — — (g + 5i)c, with 


u 


// 


, v(l — q) — nc 5 i vil — q) 

u — u = - < -. 

n n 

Suppose 0 < q < 1. If a defender i slightly decreases q to q — 62 ^ she could receive utility 
= —V + {q — S 2 )(v — c), with 


Since S 2 < q, 


u 


— u — 


v{l — q){l — n) + 62n{c 
n 


v) 


„ ^ vil - q){l - n) + qnic-v) v 

u — u < - = — 

n 

Let di — , d 2 — + (^c — v). If g = 0, a defender could deviate to increase utility 

by at most If g = 1, a defender could deviate to increase utility by at most {c — v). 
When t) < q < we have that d 2 < di^ and it is di-equilibrium. When ^ < g < 1, we have 
that d 2 > di^ and it is d 2 -equilibrium. 

Putting everything together, there is an e-equilibrium with 


( 1 -^) 


n 


+ {qc - v). 


e 


n 

vp-q) 

n 


+ {qc-v), 


if0<g< f; 
if ^ < g < 1. 


Moreover, q = ^ gives us the unique minimal e = when all defenders use the same 

coverage probabilities. 
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We now claim that the -equilibrium could only exist when all defenders play an 

identical coverage probability. Suppose defenders use different coverage probabilities. Then 
there are a defenders for 1 < a < n who have the same minimal probability q' of protecting 
their targets. The expected utility for each defender among these a defenders is: 

(v — ca)q' — V 

Uq — . 

(T 

When ^ < q' < some defender i among these a defenders could decrease her probability 
to 0 to get a utility of ui — —u, with 


v{l - q') , v{l 

Ui — Ue = - - + {q c — v) > - 


a 


n 


q') , f . v(c — v) 

^ + {qc -v)> ^ ^ 


cn 


When 0 < q' < some defender i among these a defenders could increase her probability 
to q' + ^3 to get utility U 2 = —{q' + 5s)c with 


v(l — q') — ac6s v(l — q') vie — v) 

U2 R/g ^ ^ , 

a n cn 

where the first inequality follows because ^3 can be made arbitrarily small. 


□ 


Armed with a complete characterization of predictions of strategic behavior among the 
defenders, we can now consider how this behavior related to socially optimal protection 
decisions. Since the solutions are unique, there is no distinction between the notions of 
price of anarchy and price of stability; we term the ratio of socially optimal welfare to 
welfare in equilibrium as the price of anarchy for convenience. 

First, we characterize the socially optimal outcome. 


Theorem 3. In the Baseline model, the optimal social welfare SWq is 


SWo 


—cn^ if V > cn; 
—u, if V < cn. 


Proof sketch. We firstly claim that we could get optimal social welfare only if all defenders 
use the same coverage probability q. If their coverage probabilities are different, and some 
defender j has probability 0 of being attacked, we could decrease qj to improve social welfare. 
Therefore we need only to consider identical coverage probabilities q in determining optimal 
social welfare. Welfare, as a function of symmetric coverage q is 


SW{q) = —V + q{v — c) + {n — 1)(—c) — {v — cn)q — v. 


When V > cn^ q = 1 is optimal, whereas g = 0 is optimal otherwise, giving the desired 
result. □ 


From this result, it is already clear that defenders systematically over-invest in security, 
except when values of the targets are quite high. This stems from the fact that the attacker 
creates a negative externality of protection: if a defender protects his target with higher 
probability than others, the attacker will have an incentive to attack another defender. 
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Figure 2: Price of Anarchy when v > c 


In such a case, we can expect a “dynamic” adjustment process with defenders increasing 
their security investment well beyond what is socially optimal. To see just how much the 
defenders lose in the process, we now characterize the price of anarchy of our game. 

If u > c, social welfare in the unique equilibrium with q — 1 is 


SWe = -cn. 


The associated PoA is then 


PoA=\^’ 

I if c < u < cn. 

Figure 2 shows the relationship among PoA, the number of defenders, and the ratio 
From the figure we can see that when number of defenders and - are small (e.g. n < 5 and 
^ = 0.2), the price of anarchy is close to 1. Otherwise, PoA is unbounded, growing linearly 
with n. 

When u < c, there is no Nash equilibrium. However, the optimal e-equilibrium features 
all defenders with the same coverage probability ^ for their targets. The corresponding 
Social Welfare is 

SWe ^ (v - cn)- - V, 
c 

and the associated ^^^~^^ -PoA is ^ which is, again, linear in n. 

4.2 The Multi-Target Model 

Armed with observations from the model with a single target for each defender, we now 
extend the model to a case not as yet considered in the literature in a theoretical light: 
each defender protects a set of k targets. This gives rise to a combinatorial set of possible 
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Figure 3: e value when u = 10, c = 1 


decisions for each defender, so that even computing a best response is not necessarily easy. 
Remarkably, we are able to characterize equilibria and approximate equilibria in this setting 
as well. The proofs for this subsection are in the appendix. 

Our first result is almost a mirror-image of the corresponding result in the baseline 
model: when a Nash equilibrium exists, all defenders protect all of their targets with prob¬ 
ability 1. 

Theorem 4. In the Multi-Target model, Nash equilibrium exists if and only if u > kc. In 
this equilibrium all targets are proteeted with probability 1. 

Next, we consider scenarios when v < kc^ in which there is no Nash equilibrium. Our 
next result characterizes optimal (lowest-e) approximate Nash equilibria. 

Theorem 5. In the Multi-Target model, if v < kc, then in the optimal e-equilibrium all 
targets are proteeted with probability The eorresponding e is 

Thus, as n increases, the optimal approximate equilibrium approaches a Nash equi¬ 
librium. Figure 3 illustrates the relationship between e and the number of targets each 
defender protects when u = 10 and c = 1. In this figure, e = 0 when k < 10, which means 
that an exact Nash equilibrium exists; e increases with k when > 10, but at a decreasing 
rate, converging to ^ as A: ^ oc. 

Finally, we characterize socially optimal welfare, and, subsequently, put everything to¬ 
gether in describing the price of anarchy. 

Theorem 6. In the Multi-Target model, the optimal soeial welfare SWq is 


SWo 


—cnk, ifv> cnk; 
—u, if V < cnk. 


Thus, just as in the baseline case, the defenders will generally over-invest in security. 
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Figure 4: (Approximate) Price of Anarchy when ^ = 0.1 


If V > kc^ there is a unique Nash equilibrium with all targets protected with probability 
1. The corresponding social welfare is 


SWe = -cnk 


Because it is the only Nash equilibrium, the Price of Anarchy is 

_ , I 1, if u > cnk: 

PoA = < \ 

I if ck < V < cnk. 

If V < kc^ there is no Nash equilibrium. The optimal approximate equilibrium features 
identical coverage probability of ^ for all targets. The corresponding Social Welfare is 

SWe = (u — cnk )^— u, 
kc 

and the associated ^^^^^^-Price of Anarchy is n + 1 — Clearly, in either case, and just 
as in the baseline model, the price of anarchy is unbounded, growing linearly with n. 

We now consider how PoA changes as a function of fc, i.e. the number of targets each 
defender has. When A: < ^, a Nash equilibrium exists and the PoA is 1; when ^ < A: < ^, 
PoA increases linearly in k with slope However, when A: > ^, a Nash equilibrium does 
not exist and the approximate PoA is n + 1 — which increases very slowly with A;, and is 
bounded by n + 1 when k ^ oo. Figure 4 illustrates the relationship between (approximate) 
Price of Anarchy and A; for n = 2. When k is very small, PoA = 1. For intermediate A;, 
PoA increases linearly, and when k is sufficiently large, Nash equilibrium no longer exists, 
and e-PoA increases quite slowly, converging to 3 when k ^ oo. 
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4.3 The General Model 

Both the baseline and the multi-target models made rather strong assumptions about the 
structure of the utility functions of the players. In the general model, we relax these 
assumptions, allowing for arbitrary utilities for the players when the target is attacked or 
not, and when it is protected or not (when attacked). Quite surprisingly, our findings here 
are qualitatively different: the special case of the baseline and multi-target models turns 
out to be an exception, rather than the rule when more general models are considered. 
Just as before, we start by characterizing Nash and approximate Nash equilibria. 

Theorem 7. In the General model, Nash equilibrium exists if and only if — U'^ > 
_ {n-i){n-U ) ^ equilibrium all targets are proteeted with probability 1. 

Proof. We firstly claim that Nash equilibrium can appear only if coverage probabilities of 
all of targets Uj are identical. Otherwise, there will be a target fk which has the probability 
0 of being attacked, and the defender i has an incentive to decrease qik. To determine a 
Nash equilibrium, we therefore need only consider scenarios in which all targets have the 
same coverage probability. 

When all targets have the same coverage probability q to be protected, the utility of 
each defender is 

me _ ^ 7 ^ _ nkc)q + U^ + ink - 1)5^ 

u =-. 

n 

li q < 1, then some defender i could increase q to q + 5 ioi all of her targets to ensure none 
of them are attacked, and obtain utility of N — kPt — k{q + J)c, so that 

, (U^ - U^)(l -q) + (n- U^) - nkc6 

u — u — -. 

n 

As and 6 can be arbitrarily small, u' — u > 0 when ^ < 1, which means 

that this cannot be a Nash equilibrium. Thus, the only possible equilibrium can be qij — 1 
for all targets pj. 

When all targets have the same coverage probability q — each defender’s utility is 

— nkc + ink — 

u = -. 

n 

We claim that if a defender i has an incentive to deviate, it is optimal for this defender 
to use the same coverage probability for all her targets. Otherwise, for some target tik 
which has probability 0 of being attacked, she could decrease to obtain higher utility. 
If probabilities of targets protected by defender i are all q' (0 < g' < 1), then her expected 
utility is u' = — U'^ — c)q' + + {k — — g'c), and 

u' -U^{W-U^- kc){q' - 1) + 


We therefore have two cases: 

1) If > kc^ then ix' — ix < 0, and q — 1 ioi all targets is a Nash equilibrium. 
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2) If < kc^ the maximal value of u' — u corresponds to q' = 0: 

(ti 

max u — u — —iU^ — U'^ — kc) - 

o<q'<i n 

If kc — (^~^)(^~^ ) < — U'^ < kc^ u' — u < 0, it is a Nash equilibrium; otherwise, it is 

not. 

To sum up, a Nash equilibrium exists if and only if — U'^ > kc — (^~^)(^~^ ) ^ the 
equilibrium corresponds to all targets having probability 1 of being protected. □ 

Next, we characterize the optimal approximate equilibrium when no Nash equilibrium 
exists. 



Theorem 8. In the General model, in the optimal e-equilibrium all targets are proteeted 
with probability . The eorresponding e is ^ . 

^ ^ k,r -L J rn.k, 


Proof When all targets have the same coverage probability g, the expected utility of each 
defender is 

_ {U^ -U^ - nkc)q + U^ + {nk - l)n 
n 

Suppose 0 < g < 1. If some defender i increases q to q + 6ij for target then she would 
obtain utility u' = — {q + Sij)c^ and 


u — u — 


< 


n-{U^- U^)q - 


n 


J = 1 


U^)q - 


n 


(1) 


Now we consider scenarios in which a defender i could obtain higher utility by decreasing 
protection probability. We claim that if a defender i has an incentive to deviate, it is 
optimal for this defender to use the same coverage probability for all her targets. Otherwise, 
for some target pk which has probability 0 of being attacked, she could decrease to 
obtain higher utility. Thus, we need only consider cases in which a defender deviates 
by decreasing coverage probabilities for all her targets to q — 6. Her utility will become 
u" — {U^ — U'^ — kc){q — 5) + U'^ + {k — 1)12. Since — U'^ < kc^ 5 = q (the maximal value 
of 5) maximizes u" — u: 


max u" — u = 
0<5<g 


- U^)q 
nk 


Uu 


+ kcq + u^-n. 


( 2 ) 


By comparing the value of equation (1) and equation (2), we get different values of e for 
e-equilibrium: 

._J n ’ ^ - kc ^ 

M +1/"-n, 

When q = , we get the minimal e = (^-L . 
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We claim that the ^ -equilibrium can appear only if all targets have 

the same coverage probability q. We prove this by contradiction. Suppose that targets 
have different coverage probabilities. This gives rise to two cases: 1) Each defender uses an 
identical coverage probability for each target she owns (these may differ between defenders); 
and 2 ) Some defender has different coverage probabilities for her targets. In case 1), there 
exist /3 defenders (1 < /3 < n) who have the same minimal coverage probability q'. The 
expected utility for each defender among these (3 is 

me _iju_ 

”=- p -^ 

When < 1, some defender i among these /3 could decrease the coverage proba¬ 
bility of all her targets to 0 and obtain the utility of ui = + {k — 1 ) 12 , so that 


n-{U^-U^)q' -U^ , ; rru 

R-x — ^ -“1“ kcq U — 12 

n-{U^-U^)q' -U^ , ; rru 

> -T kcq U — 12. 

n 

When 0 < q' < ? some defender i among these /3 can increase coverage probabilities 

of all her targets to q' + ^3 to obtain utility of U 2 — A:12 — k{q' + ^ 3 ) 0 , with 


U 2 - u = 


> 


n-{U^- U^)q' -U^ - k^cSs 

U^)q' - 
n 


where the inequality holds because ^3 can be arbitrarily small. Thus, no profile in case 1) 
can be a ^ -equilibrium. In case 2), any defender who has different coverage 

probabilities among her targets can always increase her payoff by decreasing the coverage 
probabilities of the targets with higher coverage to yield identical coverage for all targets. 
Consequently, no profile in case 2 ) can be a ^ -equilibrium. □ 


As the final step towards characterizing the Price of Anarchy, we derive optimal social 
welfare in this model. 


Theorem 9. In the General model, the optimal soeial welfare SWq is 

(u^ — nkc + {nk — 1 ) 12 , if > nkc; 

SWo — \ 

[U^ + {n- 1)12, if U^-U^ < nkc. 

Proof sketch. We firstly claim that we could get optimal social welfare only if all targets 
have the same coverage probability q. Otherwise, some target tij has probability 0 of being 
attacked, and we can decrease qij to improve social welfare. Consequently, we need only 
to consider an optimal symmetric coverage probability q to maximize social welfare, which 
can be done in a manner similar to that for the baseline case. □ 
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Number of Targets Each Defender Has 


Figure 5: (Approximate) Price of Anarchy when c = 1, Ti = —1, = —2 and = —10 


If — U'^ > kc— ^ ^ Nash equilibrium is unique, with all targets protected 

with probability 1. The corresponding social welfare is 

SWe = U^- nkc + {nk - l)fl. 


So far we have not yet added any constrains to value of fl, and U'^ (except that 
Q >U^ > f/^). In order to make Price of Anarchy well-defined, we need to add constraints 
that values of fl, and U'^ are all non-positive (just as in the previous two models) or all 
non-negative. To be consistent with previous models, we add constraints that and 

n are all non-positive (little changes if all are non-negative). 

In the case of a unique Nash equilibrium, the price of anarchy is 


PoA = < 


1 , 

U^-U'^-nkc 

+ 1)0 


+ I5 


it U‘-U'‘> nkc-, 

ifS,^_(nzlX5z£!l< 

n — 

pc-u^ < 


If — U'^ < kc — ^ ^ there is no Nash equilibrium. The Social Welfare in the 

optimal approximate equilibrium is 

O — TI^ 

SWe = (U^ -U^- nkc)— - + U^ + (nk - 1)J^, 

kc 

and the + 1- 

We now analyze the relationship between (e-)PoA and the values of n and k. Here are 
the key differences from the Multi-Target Model. First we consider (e-)PoA as the function 
of n. If = 0, the result is same as that in the Multi-Target Model: (e-)PoA linearly 
increases in n, and is therefore unbounded. However, if Ti ^ 0, while PoA and e-PoA are 
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U^-Q 


, respectively. In other words, 


increasing in n, as n ^ oo, they approach 1 — ^ and 1H—^ 

PoA (exact and approximate) is bounded by a constant, for a constant k\ 

Consider now approximate price of anarchy as a function of k. If = 0, it is bounded by 
n + 1. However, if ^ 0, when kc— ) < jjc _-uu^ jg increasing function of k. 


When kc — 


(n-l)(0-U^) 


> — it may at first increase or decrease in A;, depending on the 


the values of the model parameters. However, when k is large enough, price of anarchy will 
invariably be decreasing in A;, and as A; ^ oo, e-PoA ^ 1. Figure 5 provides an example of 
the relationship between e-PoA and k. Observe that all the curves begin to decrease when 
k > 10, and they all approach 1 as A; ^ oo. Thus, price of anarchy in the general model is 
only unbounded in the special case when = 0, whereas when ^ 0, price of anarchy is 
always bounded by a constant. This observation is particularly surprising and significant 
considering the fact that the baseline and simplified multi-target models are quite natural, 
and seemingly innocuous, restrictions of the general case. 


5. Analysis of Interdependent Multi-defender Security Games 

We now develop and analyze a computational framework for approximating Nash equilibria 
in interdependent multi-defender security games. A crucial step in computing (or approx¬ 
imating) a Nash equilibrium of a game is to consider the problem of computing a best 
response for an arbitrary player (in our case, defender, since the attacker’s best response is 
straightforward). Next, we develop a novel mixed-integer linear programming formulation 
for computing ASE best response, and then propose a hightly effective heuristic method for 
approximating ASE in multi-defender games. 

5.1 Computing Defender Best Response: A Mixed-Integer Linear 
Programming Formulation 

While ASE seems a very natural alternative to SSE even in two-player security games, we 
are not aware of any proposals for computing it. Below, in equations 3-13, we present the 
first (to our knowledge) mixed-integer linear programming formulation for computing ASE 
which, in our case, would compute a best response for an arbitrary defender i when the 
strategies of all other players, q-i^ are fixed. 


max u — > > 

a,qi,s,u,v ^ ^ ^ 

j^Ti oGO 


(3) 

S.t. 
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o 

V j e 

(9) 

o 

y j^Ti 

(10) 

Q 

V j E T.i 

(11) 

aj Adsj ^ 1 

y JET 

(12) 

U = f{q,a), 


(13) 


where M is a very large number and 


f{Q,a) = 


T.jeTi EoeO + Y.j 




Eoso ^—i,j y 


E 


jeT ^3 


While constraint 13 is non-linear, we can linearize it using McCormick inequalities. Con¬ 
straints 4 and 5 ensure that the defender’s strategy is a valid probability distribution. 
Constraint 7 ensures that at least one target is chosen by the attacker. Constraints 8 and 9 
compute the optimal attacker utility u; alone, they ensure that this utility corresponds to 
some attack target. Constraints 10 and 11 compute an auxiliary variable which is 0 if 
and only if attacking a target j yields an optimal utility to the attacker. These variables, 
together with constraints 12 and 8-9 ensure that the binary variable aj = 1 if and only if 
the attacker (weakly) prefers to attack target j; that is, these jointly compute the set of 
optimal attack targets. Finally, constraint 13 computes the expected utility to the defender 
if the attacker chooses one of his most preferred targets uniformly at random. 

If M is infinite and numbers can be computed to arbitrary precision, the above for¬ 
mulation is correct. In practice, of course, numerical precision and stability are an issue, 
and they arise with this formulation. Consider constraints 10 and 11, which compute sj 
for all targets j. These require that the expected value of the target exactly equal the 
optimal attacker utility computed in constraints 8-9; even a slight error will technically 
violate our requirement that Sj = 0 at an optimal target. Moreover, even if the difference 
is, indeed, non-zero, from an attacker’s perspective it seems intuitive that sufficiently small 
differences from optimal utility are ignored. We address these problems by adding a fixed 
small quantity 6 to the right-hand-side of constraints 8, 9, while subtracting it from the 
right-hand-side of constraint 12. Because the constraints are interrelated, we cannot simply 
choose an arbitrary 0 < 5 < 1, but must ensure that 6 and M satisfy the constraint that 
M6 <1-6. 


5.2 Approximating ASE 

Previously, Vorobeychik and Wellman (2008) presented a convergent equilibrium approxi¬ 
mation algorithm based on simulated annealing (SA) that would be applicable in our setting. 
They additionally showed in simulation that SA is actually outperformed by a simple heuris¬ 
tic based on iterated best response (IBR) dynamics. Here, we interpret IBR as a local search 
heuristic, with the property that if the starting point is a Nash equilibrium, IBR will never 
deviate from it (i.e., Nash equilibrium is a fixed point). Clearly, then, the choice of a starting 
point can be significant for the performance of IBR, making it natural to consider coupling 
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it with random restarts. Our main contribution in this section is to present evidence that 
IBR with random restarts is a highly effective equilibrium approximation approach in our 
setting (and outperforms several alternatives). This is both of broad significance, and of 
particular importance in our setting, as we use this algorithm for our analyses below. 




Figure 6: Comparison of algorithms. Left: \D\ 
\T\ = 20. 


2 and \T\ = 10. Right: \D\ = 5 and 


We compare the following Nash equilibrium approximation algorithms executed for 1000 
iterations: random search (RS), which simply generates 1000 strategy profiles randomly, 
computes the game theoretic regret of each, and chooses a profile with the smallest regret; 
simulated annealing (SA), with the temperature exponentially increasing with iterations; 
and iterated best response (IBR) with no restarts. We also include in the comparison two 
additional variations of IBR: the first uses SA for the first 100 iterations, and then switches 
to IBR for the remainder (starting with the best approximation produced by SA); the second 
is IBR with random restarts, which we term RIBR. RIBR includes initial corner cases that 
may be hard to converge to in a limited amount of time (i.e., all defenders not defending, 
all defenders defending completely). We execute our comparison on games with 2 players 
and 10 targets and games with 5 players and 20 targets. In all cases, targets are divided 
evenly among the players, and values over the targets are generated uniformly at random. 
The cost of defense is fixed at c = 0.2, and the targets are assumed to be independent 
(but players may have values for targets under the control of other defenders). Figure 6 
demonstrate that in both settings, RIBR outperforms other alternatives. 

5.3 Analysis of Multi-Defender Games on Synthetic Networks 

For our first set of experiments, we use RIBR on 3 artificially generated networks, with 40 
samples for each parameter variation. First, we will illustrate and compare the results of our 
interdependent multi-defender game on artificial networks. We use 3 commonly analyzed 
network structures: a grid, Erdos-Renyi networks, and preferential attachment networks. 
In all of the generated networks, there are 64 nodes or targets. For the latter two, we use the 
Metis graph partitioning software to partition the nodes (targets) among defenders. This 
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Grid Erdos-Renyi Preferential Attachment 








H |p=0.1 I— |p=0.2 H |p=0.3 Hp=0.4 Hp= 0.5 t-|p=0.6 h-lp=0.7 F-[p=0.8 f-Ip=0.9 


Figure 7: Approximate equilibrium security outcomes for varying cascade probability p, as 
a function of (log of) the number of players (level of decentralization) for the 
three synthetic networks. Top: Social welfare. Bottom: Average strategy (higher 
strategy corresponds to higher average probability of defense). 


software partitions nodes to minimize connectivity among the targets belonging to different 
defenders, a property that we expect to common hold in real networks due to efficiency 
considerations. 

We begin by considering average strategies, as well as social welfare, for the three differ¬ 
ent synthetic networks (grid, Erdos-Renyi, and preferential attachment), as a function of the 
number of players (degree of decentralization) and the cascade probability (interdependent 
risk). The results are shown in Figure 7. We do not show these as a function of defense 
cost as increasing defense cost roughly mirrors decreasing cascade probability p. The first 
rather stark observation is that network structure makes little difference when each node is 
controlled by a single player, but it makes a significant qualitative difference both for social 
welfare and actual strategies utilized by the players in all other cases. 

Looking at the results in greater detail, let’s consider first social welfare (Figure 7, top). 
First, when interdependent risk is low (0.1 < p < 0.3), social welfare follows a relatively sim¬ 
ple pattern: increasing decentralization makes initially almost no difference, until sufficiently 
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Power Network 1 


Power Network 2 


Power Network 3 








H |p=0.1 I— |p=0.2 Hp=0.3 Hp=0.4 Hp= 0.5 t-|p=0.6 h-lp=0.7 F-[p=0.8 b-Ip=0.9 


Figure 8: Approximate equilibrium security outcomes for varying cascade probability p, as a 
function of (log of) the number of players (level of decentralization) for the three 
real power networks. Top: Social welfare. Bottom: Average strategy (higher 
strategy corresponds to higher average probability of defense). 


many players are involved, at which point social welfare falls rather dramatically; this pat¬ 
tern is roughly monotonic with increasing decentralization, with worst outcomes emerging 
when each player controls a single node, and mirrors previous findings (Vorobeychik et ah, 
2011). Both Erdos-Renyi and preferential attachment networks are less susceptible to the 
negative effects of decentralization in this case than the grid network, where the dropoff 
occurs with fewer players (less decentralization). This may be largely a consequence of the 
fact that network partitioning tools we use attempt to minimize interdependence among 
players—something that is likely to mirror reality—and far more opportunities for doing so 
exist in Erdos-Renyi and preferential attachment models. 

When p is higher (greater interdependencies), the results exhibit an entirely new phe¬ 
nomenology. Across all three network models, for a sufficiently large p, the impact of 
decentralization is non-monotonic: an intermediate level of decentralization has the most 
detrimental impact on security, while a highly decentralized system becomes near-optimal! 
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1 player 4 players 64 players 



Figure 9: Strategic realizations for representative games on grid and distribution network 
topologies. Top: Grid, p = 0.1, middle: Grid, p = 0.4, bottom: Power Network, 
p — 0.7. Darker node colors indicate higher probability of defense (coverage); 
so white implies no coverage and black implies that the node is always covered 
(defended). Dotted red lines indicate the partition of nodes among players (except 
in the rightmost case, when each player controls a single node). 


Investigating actual (average) strategic decisions by the players yields deeper insights 
into the findings above. When interdependencies are weak, optimal decision is to invest 
relatively little in security, in any generative model. Increasing decentralization, therefore, 
gives rise primarily to over-investment, mirroring our analytical results for the limiting 
case when targets were independent, although the tendency to over-invest is quite weak 
until the network is extremely decentralized, except in the grid network. When p is high, 
on the other hand, the predominant phenomenon is underinvestment. This in itself is not 
surprising: after all, a high level of interdependencies should imply that positive externalities 
of security should be dominant. What is surprising is, again, non-monotonicity in the 
level of decentralization: when decentralization is moderate, underinvestment can be quite 
dramatic. On the other hand, a high level of decentralization often appears to dull this 
effect, and the level of investment in security becomes much closer to optimal. 
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p = 0.1 


p = 0.4 


p = 0.7 



Figure 10: Node security and network (closeness) centrality; each defender controls 4 nodes. 


5.4 Results on power grid networks 

The grid network studied above is arguably the most “artificial”, in the sense that both 
Erdos-Renyi and preferential attachment models were developed in part to resemble real 
networks (this is particularly true of the latter, which aims to replicate the scale-free prop¬ 
erties of observed networks). Surprisingly, however, the approximate equilibrium results 
applied to three snippets of actual power networks most resemble the phenomena observed 
for the grid, as can be seen in Figure 8. In particular, just as in the grid above, over¬ 
investment in security appears to dominate, even at relatively high levels of interdepen¬ 
dence, but only when decentralization is significant, while most levels of decentralization 
are relatively robustly near-optimal (these are, in fact, more robust to decentralization than 
the grid network above). 

To dig somewhat deeper into the rather complex phenomenology we have observed. Fig¬ 
ure 9 shows several examples of actual strategy realizations. First, consider the top series of 
plots for the grid with cascade probability p = 0.1. As previously described, we can clearly 
see that the optimal security configuration involves no security investment (leftmost grid), 
whereas an increasing level of decentralization gives rise to increased security investment, 
culminating, ultimately, with full protection in the extreme level of decentralization. The 
contrast between the two extremes offers some guidance: even though optimal global con¬ 
figuration involves no security, when each player controls (and cares about) only a single 
node, the best response of an attacked node is to defend it just enough to force the attacker 
to attack another; for example, slightly more than the next weakest node. Iterating on 
this idea, strategies “cascade” to full defense. When the player controls more than one 
node, however, there is suddenly strategic tension: higher security on one node may well 
push the attacker to attack another node under this player’s control. Positive externalities 
become more significant as well: pushing the attacker to attack another node “nearby” is 
likely to gain little when cascade probabilities are high and multiple nodes owned by the 
defender could be affected. For sufficiently high cascade probabilities, and sufficiently low 
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number of players, such positive network effects can actually sway players to under-invest 
in security, as we can see both in the middle and last rows of Figure 9 (the 4-player case). 
Here, strategic complementarities make security investment not worthwhile in equilibrium: 
the nodes that need to be defended are relatively central, and cut across different players 
(i.e., the critical central nodes create a kind of “buffer” between defenders). This behavior 
diminishes as decentralization increases. 

5.5 Security and Network Centrality 

Finally, we consider the relationship between network centrality and strategic choice of a 
node. The results, shown in Figure 10, plot each node’s closeness centrality and correspond¬ 
ing security across 10 instances for each network. When the interdependence of networks 
is low, security investment appears to correlate rather strongly with closeness in synthetic 
networks: in other words, nodes more central in the network invest more in security. A 
similar relationship is seen with node degrees. This correlation, however, largely disappears 
with greater interdependence, likely because the difference between being one-hop vs. two- 
hops away from an attacked node (i.e., being a low-connected neighbor of a high-connected 
node) becomes considerably less in such a case. 

For the power networks, the correlation between centrality and strategy remains even 
with higher cascade probabilities. This relationship can be seen by comparing the strategy 
profile in the second column of Figure 9 and the closeness plot for p = 0.7 for the power 
network. The structure of the power network has several small ’’chains” that have low 
centrality. The node that connects the chain to the rest of the network (higher centrality) 
incurs most of the defense cost, since a failure cascade would have to pass through this node 
to reach the rest of the chain. As the amount of interdependence increases (column 3 in 
Figure 9), these chains become partitioned between defenders, weakening the relationship 
between centrality and strategy. 

6. Conclusion 

In this work, we have extended the current state of Stackelberg security games to include 
multiple defenders in non-cooperative scenarios with independent and interdependent tar¬ 
gets. 

For the independent case, we provided complete characterizations of Nash and approx¬ 
imate equilibria, socially optimal solutions, and price of anarchy (PoA) for three models of 
varying generality. Our analysis showed that defenders generally over protect the targets, 
but different modelling assumptions give rise to qualitatively different outcomes: a simpler 
model gives rise to an unbounded PoA, whereas a more general model sees PoA converge 
to a constant when the number of defenders increases. 

For the interdependent case, we developed a novel computation framework to overcome 
the difficulties of providing a concise formal analysis of such a complex model. Our sim¬ 
ulations characterize a broad space of strategic predicaments, varying cascade probability, 
network structure, and system decentralization. In contrast to the independent models, our 
results show differing behavior in terms of security investment dependent on the strength 
and structure of interdependencies. One of our most stark findings is the non-monotonicity 
of welfare and strategic choices as a function of the number of players: in a number of 
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cases, higher levels of decentralization become near-optimal, even while intermediate de¬ 
centralization leads to very poor outcomes. As security decisions are almost universally 
decentralized, and often highly interdependent, our findings enable a deeper understanding 
of practical security considerations, highlighting the importance of both^ over- and under¬ 
investment in security, and the dependence of each on network structure, the magnitude of 
network externalities, and the level of decentralization. Finally, we have shown how security 
behavior in our model on real-world power networks relates to those in synthetic networks, 
highlighting similar behaviors with grid networks, and unveiling structural differences with 
Erdos-Renyi and preferential attachment networks using the relationship between strategy 
and centrality. 
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Appendix 

Theorem 4. In the Multi-Target model, Nash equilibrium exists if and only if u > kc. In 
this equilibrium all targets are protected with probability 1. 

Proof. We firstly claim that a Nash Equilibrium must have identical coverage probabilities 
for all targets tij. Otherwise, there will be a target tik which has the probability 0 of being 
attacked, and defender i has an incentive to decrease qik. 

When all targets have the same coverage probability g, the expected utility of each 
defender is 

(v — cnk)q — v 

u — -. 

n 

If g < 1, then some defender i can increase q to q -\- 6 for all of her targets to make sure 
none are attacked, obtaining utility u' — —k{q -\- 5)c, with 

f v{l — q) — nkc6 


As 6 can be arbitrarily small, u' — u > 0 when ^ < 1, and it cannot be a Nash equilibrium. 

When all targets have the same coverage probability q — the expected utility of each 
defender is u = —kc. If defender i want to deviate, then one of her targets will be attacked. 
Suppose this target is with qij. < 1. Then her expected utility is 

k 

u' = -V + qir{v - c) + {-QijC), 

j=ljjr 


and 


k 

u'- u= {v - c){qir-l) + {-qijc) + [k - l)c. 


We now have two cases: 
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1) If V < then {v — c){qir — 1) > 0 and + {k — l)c > 0, and the defender 

has an incentive to deviate, and there is no Nash equilibrium. 

2) If V > then, because qij > qir for all j ^ r, 

u' — u < {v — c){qir — 1) + {k — l)c — {k — l)qirC 


and 


u 


u < {qir — 1)(^’ — kc). 


In case 2, therefore, u' — u < 0 iS v > kc^ which yields the desired result. 


□ 


Theorem 5. In the Multi-Target model, if v < kc, then in the optimal e-equilihrium all 
targets are proteeted with probability The eorresponding e is ♦ 

Proof. When all defenders have the same coverage probability q, the expected utility of 
each defender is 

[v — cnk)q — v 


u 


n 


Suppose 0 < q < 1. If some defender i increases q to q + 6ij for target Uj, then she would 
obtain a utility of 


u — u — 


v{l - q) 


n 


-E^ 


c < 


v{l - q) 


n 


We claim that if a defender i has an incentive to deviate, it is optimal for this defender 
to use the same coverage probability for all her targets. Otherwise, for some target tik 
which has probability 0 of being attacked, she could decrease q'-j^ to obtain higher utility. 
Therefore, we need only consider cases when a defender deviates by decreasing probabilities 
of all her targets to q — 6. Then her utility is u" = {v — kc){q — 6) — v, and 


u — u = 


6n{kc — v) + v{n — l){q — 1) 


n 


Since v < kc when 6 — q (the maximal value of 6), the maximal value of u" — u is 

v{l-q) 


max u — u = 
0<S<q 


n 


+ kcq — V. 


Let di = , d 2 = + kcq — v. Then, 

di — d2 = —kcq + V 

When ^ di > ^2, and it is a 1^^^-Nash Equilibrium; when q > di < d 2 j and it is 


a ( 


vp-q) 


+ kcq — '?;)-Nash Equilibrium. To sum up, for e-Nash Equilibrium, 


e = 


' v{l-q) 
n 

n 


if0<g<^; 
+ kcq-v, if ^ < g < 1. 


30 



Multidefender Security Games 


When (7 = 7 ^, we obtain the minimal e = ^ 

We claim that the ^^^^^^^ -Nash Equilibrium can appear only if all targets have the 
same coverage probability q. Suppose targets have different coverage probabilities. This 
implies two possibilities: 1) Each defender uses identical coverage probabilities for her 
targets, and differences exist only between defenders; 2 ) Some defender uses diverse coverage 
probabilities among her own targets. 

We first consider case 1), in which targets may have different probabilities of bing 
protected, but each defender has the same probability of protecting her own targets. In this 
case there exist /3 defenders {1 < (3 < n) who have the same minimal coverage probability 
q'. The expected utility for each defender among these (3 defenders is: 


Up = 


(u — kcj3)q' — V 

^ ‘ 


When ^ < q' <1^ some defender i among these a defenders can decrease probability of all 
her targets to 0 to obtain a utility of ui — —u, with 

v{l-q') , v{l-q') , 

ui-Ue = - —- + {kcq -v)> - - + {kcq - v). 


/3 


m 


When 0 < g' < ^, some defender i among these /3 defenders can = increase probability of 
all her targets to q' + ^3 to obtain a utility of U 2 — —k{q' + ^ 3 ) 0 , with 

v(l — q') — kcl36s v(l — q') 

U2-Ue = --- > -, 

p n 

where inequation holds because ^3 can be arbitrarily small. Consequently, case 1 cannot 
have a ^^^^^^-Nash Equilibrium. 

Now we consider case 2), in which there exists a defender who uses different coverage 
probabilities among her targets. As some of her targets have probability 0 of being attacked, 
she could increase her payoff by decreasing probabilities of all of these targets to be as small 
as her target with the lowest coverage probability. It means that each coverage profile in 
case 2 can always be transformed to a coverage profile in case 1 in a way that improves a 
defender’s utility without hurting anyone else. This means that case 2 also cannot have a 
^^^^^j^-Nash Equilibrium. □ 

Theorem 6. In the Multi-Target model, the optimal soeial welfare SWq is 


SWo 


—cnk^ ifv>cnk; 
—u, if V < cnk. 


Proof sketeh. We firstly claim that we could obtain optimal social welfare only if all targets 
have the same coverage probability q. Otherwise, some target tp has the probability 0 of 
being attacked, and we can decrease gp to improve social welfare. Consequently, we need 
only to consider an optimal identical coverage probability q to obtain optimal social welfare, 
which can be done in a way similar to the baseline case. □ 
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